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VASCULAR ENDOTHELIAL GROWTH FACTOR-E 



The present invention is concerned with a novel 
vascular endothelial growth factor (VEGF) herein 
designated "VEGF-E", and characterisation of the 
nucleic acid and amino acid sequences of VEGF-E. 

Angiogenesis involves formation and proliferation of 
new blood vessels, and is an essential physiological 
process for normal growth and development of tissues 
in, for example, embryonic development, tissue 
regeneration and organ and tissue repair . 
Angiogenesis also features in the growth of human 
cancers which require continuous stimulation of blood 
vessel growth. Abnormal angiogenesis is associated 
with other diseases such as rheumatoid arthritis and 
psoriasis. 

Capillary vessels consist of endothelial cells which 
carry the genetic information necessary to proliferate 
to form capillary networks. Angiogenic molecules 
which can initiate this process have previously been 
characterised. A highly selective mitogen for 
vascular endothelial cells is vascular endothelial 
growth factor (VEGF) (Ferrara et al . , "Vascular 
Endothelial Growth Factor: Basic Biology and Clinical 
Implications". Regulation of angiogenesis, by I.D. 
Goldberg and E.M. Rosen 19 97 Bikhanser Vertag 
Basle/^witzerland) . VEGF is a potent vasoactive 
protein which is comprised of a glycosylated cationic 
46-49 kd dimer having two 24 kd subunits. It is 
inactivated by sulfhydryl reducing agents and is 
resistant to acidic pH and to heating and binds to 
immobilised heparin. 
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VEGF has four different forms of 121, 165, 189 and 206 
amino acids due to alternative splicing. VEGF121 and 
VEGF165 are soluble and are capable of promoting 
angiogenesis, whereas VEGF189 and VEGF206 are bound to 
5 heparin containing proteoglycans in the cell surface. 
The temporal and spatial expression of VEGF has been 
correlated with physiological proliferation of the 
blood vessels (Gajdusek, CM., and Carbon, S.J., Cell 
Physiol., 139:570-579, (1989)); McNeil, P.L., 

10 Muthukrishnan, L. , Warder, E., D'Amore, P. A. , J. Cell. 
Biol., 109:811-822, (1989)). Its high affinity 
binding sites are localized only on endothelial cells 
in tissue sections (Jakeman, L.B. , et al . , Clin. 
Invest. 89:244-253, (1989)). The growth factor can be 

15 isolated from pituitary cells and several tumor cell 
lines, and has been implicated in some human gliomas 
(Plate, K.H. Nature 359:845-848, (1992)). The 
inhibition of VEGF function by ant i -VEGF monoclonal 
antibodies was shown to inhibit tumor growth in 

20 immune-deficient mice (Kim, K.J., Nature 362:841-844, 
(1993) ) . 

The present inventors have now identified a further 
vascular endothelial growth factor, designated herein 
25 as "VEGF-E", and the nucleic acid sequence encoding 

it, which has potentially significant benefits for the 
treatment of tumours. 

Therefore, according to a first aspect of the present 

FT 

30 invention there is provided a nucleic acid molecule 
encoding a VEGF-E protein or a functional equvalent, 
derivative or bioprecursor thereof, said protein 
comprising the amino acid sequence illustrated in 
Figure 2 or 4 . Preferably, the nucleic acid molecule 

35 is a DNA and even more preferably a cDNA molecule. 
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Also provided by this aspect of the present invention 
is a nucleic acid molecule such as an antisense 
molecule capable of hybridising to the nucleic acid 
molecules according to the invention under high 
5 stringency conditions. 

Stringency of hybridisation as used herein refers to 
conditions under which polynucleic acids are stable. 
The stability of hybrids is reflected in the melting 
10 temperature (Tm) of the hybrids. Tm can be 
approximated by the formula: 

81. 5°C+16. 6 (log 10 [Na + ]+0, 41 (%G&C) -6001/1 

15 wherein 1 is the length of the hybrids in nucleotides. 
Tm decreases approximately by 1-1. 5 °C with every 1% 
decrease in sequence homology. 

The nucleic acid capable of hybridising to nucleic 
20 acid molecules according to the invention will 

generally be at least 70%, preferably at least 80 or 
90% and more preferably at least 95% homologous to the 
nucleotide sequences according to the invention. 

25 The present invention also comprises within its scope 
proteins or polypeptides encoded by the nucleic acid 
molecules according to the invention or a functional 
equivalent, derivative or bioprecursor thereof. 

3 0 Therefore, according to a further aspect of the 

present invention, there is provided a VEGF-E protein, 
or a functional equivalent, derivative or bioprecursor 
thereof, having an amino acid sequence as illustrated 
in Figure 2 or 4 . A further aspect of the invention 

35 comprises a VEGF-E protein, or a functional 
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equivalent, derivative or bioprecursor thereof, 
encoded by a nucleic acid molecule according to the 
invention. Preferably, the VEGF-E protein encoded by 
said nucleic acid molecule comprises an amino acid 
5 sequence as illustrated in Figure 2 or 4 . 

The DNA molecules according to the invention may, 
advantageously, be included in a suitable expression 
vector to express VEGF-E encoded therefrom in a 
10 suitable host. 

An expression vector according to the invention 
includes a vector having a nucleic acid according to 
the invention operably linked to regulatory sequences, 

15 such as promoter regions, that are capable of 

effecting expression of said DNA fragments. The term 
"operably linked" refers to a juxta position wherein 
the components described are in a relationship 
permitting them to function in their intended manner. 

20 Such vectors may be transformed into a suitable host 
cell to provide for expression of a polypeptide 
according to the invention. Thus, in a further 
aspect, the invention provides a process for preparing 
polypeptides according to the invention which 

25 comprises cultivating a host cell, transformed or 
transfected with an expression vector as described 
above under conditions to provide for expression by 
the vector of a coding sequence encoding the 
polypeptides, and recovering the expressed 

3 0 polypeptides. 

The vectors may be, for example, plasmid, virus or 
phage vectors provided- with an origin of replication, 
optionally a promoter for the expression of said 
3 5 nucleotide and optionally a regulator of the promoter. 
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The vectors may contain one or more selectable 
markers, such as, for example, ampicillin resistance. 

Regulatory elements required for expression include 
5 promoter sequences to bind RNA polymerase and 

transcription initiation sequences for ribosome 
binding. For example, a bacterial expression vector 
may include a promoter such as the lac promoter and 
for transcription initiation the Shine-Dalgarno 

10 sequence and the start codon AUG. Similarly, a 
eukaryotic expression vector may include a 
heterologous or homologous promoter for RNA polymerase 
II, a downstream polyadenylation signal, the start 
codon AUG, and a termination codon for detachment of 

15 the ribosome. Such vectors may be obtained 

commercially or assembled from the sequences described 
by methods well known in the art. 

Nucleic acid molecules according to the invention may 
20 be inserted into the vectors described in an antisense 
orientation in order to provide for the production of 
antisense RNA. Antisense RNA or other antisense 
nucleic acids may be produced by synthetic means. 

25 In accordance with the present invention, a defined 
nucleic acid includes not only the identical nucleic 
acid but also any minor base variations including in 
particular, substitutions in bases which result in a 
synonymous . codon (a different codon specifying the 

30 same amino acid residue) due to the degenerate code in 
conservative amino acid substitutions. The term 
"nucleic acid sequence" also includes the 
complementary sequence to any single stranded sequence 
given regarding base variations. 



35 
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The present invention also advantageously provides 
nucleic acid sequences of at least approximately 10 
contiguous nucleotides of a nucleic acid according to 
the invention and preferably from 10 to 50 
5 nucleotides. These sequences may, advantageously be 
used as probes or primers to initiate replication, or 
the like. Such nucleic acid sequences may be produced 
according to techniques well known in the art, such as 
by recombinant or synthetic means. They may also be 

10 used in diagnostic kits or the like for detecting the 
presence of a nucleic acid according to the invention. 
These tests generally comprise contacting the probe 
with the sample under hybridising conditions and 
detecting for the presence of any duplex or triplex 

15 formation between the probe and any nucleic acid in 
the sample. 

The nucleic acid sequences according to this aspect of 
the present invention comprises the sequences of 
20 nucleotides designated herein as VEGFE 1-10, 
illustrated in Figure 5. 

According to the present invention these probes may be 
anchored to a solid support. Preferably, they are 

25 present on an array so that multiple probes can 
simultaneously hybridize to a single biological 
sample. The probes can be spotted onto the array or 
synthesised in situ on the array. (See Lockhart et 
al . , Nature Biotechnology, vol. 14, December 1996 

3 0 "Expression monitoring by hybridisation to high 

density oligonucleotide arrays". A single array can 
contain more than 100, 500 or even 1,000 different 
probes in discrete locations. 

35 The nucleic acid sequences, according to the invention 



may be produced using such recombinant or synthetic 
means, such as for example using PCR cloning 
mechanisms which generally involve making a pair of 
primers, which may be from approximately 10 to 50 
nucleotides to a region of the gene which is desired 
to be cloned, bringing the primers into contact with 
mRNA, cDNA, or genomic DNA from a human cell, 
performing a polymerase chain reaction under 
conditions which bring about amplification of the 
desired region, isolating the amplified region or 
fragment and recovering the amplified DNA, Generally, 
such techniques as defined herein are well known in 
the art, such as described in Sambrook et al 
(Molecular Cloning: a Laboratory Manual, 1989) . 

The nucleic acids or oligonucleotides according to the 
invention may carry a revealing label. Suitable 
labels include radioisotopes such as 32 P or 35 S, enzyme 
labels or other protein labels such as biotin or 
fluorescent markers. Such labels may be added to the 
nucleic acids or oligonucleotides of the invention and 
may be detected using known techniques per se. 

The protein according to the invention includes all 
possible amino acid variants encoded by the nucleic 
acid molecule according to the invention including a 
polypeptide encoded by said molecule and having 
conservative amino acid changes. Proteins or 
polypeptides according to the invention further 
include variants of such sequences, including 
naturally occurring allelic variants which are 
substantially homologous to said proteins or 
polypeptides. In this context, substantial homology 
is regarded as a sequence which has at least 70%, 
preferably 80 or 90% amino acid homology with the 



proteins or polypeptides encoded by the nucleic acid 
molecules according to the invention. 

The nucleic acid or protein according to the invention 
may be used as a medicament or in the preparation of a 
medicament for treating cancer or other* diseases or 
conditions associated with expression of VEGF-E 
protein. 

Advantageously^ the nucleic acid molecule or the 
protein according to the invention may be provided in 
a pharmaceutical composition together with a 
pharmacologically acceptable carrier, diluent or 
excipient therefor. 

The present' invention is further directed to 
inhibiting VEGF2 in vivo by the use of antisense 
technology, Antisense technology can be used to 
control gene expression through triple-helix formation 
or antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA . 
For example, the 5' coding portion of the mature 
protein sequence, which encodes for the protein of the 
present invention, is used to design an antisense RNA 
oligonucleotide of from 10 to 40 base pairs in length. 
A DNA oligonucleotide is designed to be complementary 
to a region of the gene involved in transcription 
(triple-helix - see Lee et al . Nucl. Acids Res., 
6:3073^(1979); Cooney et al . , Science, 241:456 (1988); 
and Dervan et al . , Science, 251: 1360 (1991), thereby 
preventing transcription and the production of VEGF2 . 
The antisense RNA oligonucleotide hybridises to the 
mRNA in vivo and blocks translation of an mRNA 
molecule into the VEGF2 (antisense - Okano, J. 
Neurochem., 56:560 (1991); Oligodeoxynucleotides as 
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Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). 

Alternatively, the oligonucleotide described above can 
5 be delivered to cells by procedures in the art such 
that the anti-sense RNA or DNA may be expressed in 
vivo to inhibit production of VEGF-E in the manner 
described above. 

10 Antisense constructs to VEGF-E, therefore, may inhibit 
the angiogenic activity of the VEGF-E and prevent the 
further growth or even regress solid tumours, since 
angiogenesis and neovascularization are essential 
steps in solid tumour growth* These antisense 

15 constructs may also be used to treat rheumatoid 

arthritis, psoriasis and diabetic retinopathy which 
are all characterized by abnormal angiogenesis. 

A further, aspect of the invention provides a host cell 
20 or organism, transformed or transfected with an 

expression vector according to the invention. The 
host cell or organism may advantageously be used in a 
method of producing VEGF-E, which comprises recovering 
any expressed VEGF-E from the host or organism 
25 transformed or transfected with the expression vector. 

According to a further aspect of the invention there 
is also provided a transgenic cell, tissue or organism 
comprising a transgene capable of expressing VEGF-E 

30 protein according to the invention. The term 

"transgene capable of expression" as used herein means 
a suitable nucleic acid sequence which leads to 
expression of VEGF-E or proteins having the same 
function and/or activity. The transgene, may include, 

35 for example, genomic nucleic acid isolated from human 
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cells or synthetic nucleic acid, including DNA 
integrated into the genome or in an extrachromosomal 
state. Preferably, the transgene comprises the 
nucleic acid sequence encoding the proteins according 
5 to the invention as described herein, or a functional 
fragment of said nucleic acid. A functional fragment 
of said nucleic acid should be taken to mean a 
fragment of the gene comprising said nucleic acid 
coding for the proteins according to the invention or 

10 a functional equivalent, derivative or a non- 
functional derivative such as a dominant negative 
mutant, or bioprecursor of said proteins. For 
example, it would be readily apparent to persons 
skilled in the art that nucleotide substitutions or 

15 deletions may be used using routine techniques, which 
do not affect the protein sequence encoded by said 
nucleic acid, or which encode a functional protein 
according to the invention. 

20 VEGF-E protein expressed by said transgenic cell, 
tissue or organism or a functional equivalent or 
bioprecursor of said protein also form part of the 
present invention. 

25 Antibodies to the protein or polypeptide of the 

present invention may, advantageously, be prepared by 
techniques which are known in the art. For example, 
polyclonal antibodies may be prepared by inoculating a 
host animal, such as a mouse, with the polypeptide 

30 according to the invention or an epitope thereof and 
recovering immune serum. Monoclonal antibodies may be 
prepared according to known techniques such as 
described by Kohler R. and Milstein C, Nature (1975) 
256, 495-497. 
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Antibodies according to the invention may also be used 
in a method of detecting for the presence of a 
polypeptide according to the invention, which method 
comprises reacting the antibody with a sample and 
identifying any protein bound to said antibody. A kit 
may also be provided for performing said method which 
comprises an antibody according to the invention and 
means for reacting the antibody with said sample. 

Proteins which interact with the polypeptide of the 
invention may be identified by investigating protein- 
protein interactions using the two-hybrid vector 
system first proposed by Chien et al (1991) . 

This technique is based on functional reconstitution 
in vivo of a transcription factor which activates a 
reporter gene. More particularly the technique 
comprises providing an appropriate host cell with a 
DNA construct comprising a reporter gene under the 
control of a promoter regulated by a transcription 
factor having a DNA binding domain and an activating 
domain, expressing in the host cell a first hybrid DNA 
sequence encoding a first fusion of a fragment or all 
of a nucleic acid sequence according to the invention 
and either said DNA binding domain or said activating 
domain of the transcription factor, expressing in the 
host at least one second hybrid DNA sequence, such as 
a library or the like, encoding putative binding 
proteins to be investigated together with the DNA 
binding or activating domain of the transcription 
factor which is not incorporated in the first fusion; 
detecting any binding of the proteins to be 
investigated with a protein according to the invention 
by detecting for the presence of any reporter gene 
product in the host cell; optionally isolating second 
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hybrid DNA sequences encoding the binding protein. 

An example of such a technique utilises the GAL4 
protein in yeast. GAL4 is a transcriptional activator 
of galactose metabolism in yeast and has a separate 
domain for binding to activators upstream of the 
galactose metabolising genes as well as a protein 
binding domain. Nucleotide vectors may be 
constructed, one of which comprises the nucleotide 
residues encoding the DNA binding domain of GAL4 . 
These binding domain residues may be fused to a known 
protein encoding sequence, such as for example the 
nucleic acids according to the invention. The other 
vector comprises the residues encoding the protein 
binding domain of GAL4 . These residues are fused to 
residues encoding a test protein. Any interaction 
between polypeptides encoded by the nucleic acid 
according to the invention and the protein to be 
tested leads to transcriptional activation of a 
reporter molecule in a GAL-4 transcription deficient 
yeast cell into which the vectors have been 
transformed. Preferably, a reporter molecule such as 
6-galactosidase is activated upon restoration of 
transcription of the yeast galactose metabolism genes. 

Advantageously, the antibody according to the 
invention may also be used as a medicament or in the 
preparation of a medicament for treating tumours or 
other cjiseases associated with expression of VEGF-E. 
The invention also further provides a pharmaceutical 
composition comprising said antibody together with a 
pharmaceutically acceptable carrier diluent or 
excipient therefor . 

A further aspect of the present invention also 
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provides a method of identifying VEGF-E in a sample, 
which method comprises contacting said sample with an 
antibody according to the invention and monitoring for 
any hybridisation of any proteins to said antibody. A 
5 kit for identifying the presence of VEGF-E in a sample 
is also provided comprising an antibody according to 
the invention and means for contacting said antibody 
with said sample. 

10 The invention may be more clearly understood with 

reference to the accompanying example, which is purely 
exemplary, with reference to the accompanying 
drawings, wherein: 

15 Figure l: is a nucleotide sequence coding for a 

partial VEGF-E protein according to the 
invention. 

Figure 2: is an illustration of amino acid sequence of 
20 the nucleic acid sequence of Figure 1. 

Figure 3: is an illustration of a nucleotide sequence 
encoding VEGF-E protein according to the 
invention. 

25 

Figure 4:. is an illustration of the amino acid 

sequence of the nucleic acid sequence of 
Figure 3 . 

30 Figure 5: depicts the nucleic acid sequences of the 

first 18 human EST clones obtained from the 
BLAST search of the LifSeq™ database. 



Figure 6: 

35 



depicts the nucleotide sequences of 50 human 
EST clones obtained from the proprietary 



LifeSeq database. . . 

Figure 7: is an illustration of the nucleotide 

sequences utilised as primers to identify 
the sequence of the gene coding for VEGF-E. 

EXAMPLE 1 

A BLAST (Basic Local Alignment Search Tool; Altschul 
et al . , 1990 J. Mol. Biol* 215, 403-410) search was 
performed in the propriety LifeSeq™ human EST database 
(Incyte Pharmaceuticals, Inc., Palo Alto, CA, USA). 
BLAST produces alignments of both nucleotide and amino 
acid sequences to determine sequence similarity. 
Because of the local nature of the alignments, BLAST 
is especially useful in determining exact matches or 
in identifying homologues. While it is useful for 
matches which do not contain gaps, it is inappropriate 
for performing mot if -style searching. The fundamental 
unit of BLAST algorithm output is the High-scoring 
Segment Pair (HSP) . 

Eighteen human EST clones (Figure 5) with high 
similarity to the previously identified VEGF proteins 
were identified and a further fifty EST clones (Figure 
6) were identified using these sequences as query 
sequences, allowing us. to deduce the putative sequence 
for the new VEGF-E protein. The sequences obtained 
were compared to known sequences to determine regions 
of homology and to identify the sequence as a novel 
VEGF-E protein. Using the DNA sequence information in 
the databases we were able to prepare suitable primers 
having the sequences of VEGFE 1-10 illustrated in 
Figure 7 for use in subsequent RACE experiments to 
obtain the complete DNA sequence for the VEGF-E gene. 
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CLAIMS 

1. A nucleic acid molecule encoding a VEGF-E protein 
or a functional equivalent derivative or bioprecursor 

5 thereof, said protein comprising the amino acid 
sequence illustrated in Figures 2 or 4 . 

2. A nucleic acid molecule according to claim 1 
wherein said nucleic acid is a DNA molecule. 

10 

3. A nucleic acid molecule according to claim 1 or 2 
wherein said nucleic acid is a cDNA molecule. 

4. A nucleic acid molecule according to any of 
15 claims 1 to 3 comprising the nucleotide sequence 

illustrated in Figure 1 or 3 . 

5. A nucleic acid molecule capable of hybridising to 
a molecule according to any of claims 1 to 4 under 

20 high stringency conditions. 

6. A VEGF-E protein, or a functional equivalent, 
derivative or bioprecursor thereof, having the amino 
acid sequence illustrated in Figure 2 or 4 . 

25 

1. A VEGF-E protein, or a functional equivalent, 
derivative or bioprecursor thereof, encoded by a 
nucleic acid molecule according to any of claims 1 to 

4 * ~ . 

30 

8. A protein according to claim 7, which comprises 
the amino acid sequence illustrated in Figure 2 or 4 . 

9. An expression vector comprising a nucleic acid 
35 molecule according to any of claims 1 to 4 . 
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10. An expression vector according to claim 9 further 
comprising a nucleotide sequence encoding a reporter 
molecule, 

5 11. A nucleic acid molecule according to any of 
claims 1 to 5 for use as a medicament . 

12. Use of a nucleic acid molecule according to any 
of claims 1 to 5 in the preparation of a medicament 
for inhibiting angiogenic activity and formation and 
proliferation of new blood vessels, growth and 
development of tissues, tissue regeneration and organ 
and tissue repair or for treating cancer or rheumatoid 
arthritis or psoriasis or diabetic retinopathy. 

13. A pharmaceutical composition comprising a nucleic 
acid molecule or a protein according to any of claims 
1 to 5 or 6 to 8 respectively, together with a 
pharmaceutical ly acceptable carrier, diluent or 
excipient therefor . 

14. A host cell or organism transformed or 
transfected with an expression vector according to 
claim 9 or 10. 

15. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a VEGF-E protein 
according to any of claims 6 to 8 . 

30 16. A process for producing a VEGF-E protein 

according to any of claims 6 to 8, said process 
comprising transforming a host cell or organism with 
an expression vector according to claim 9 and 10, and 
recovering the expressed protein from said host cell 

35 or organism. 



10 



15 



20 
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17. An antibody capable of binding to a protein 
according to any of claims 6 to 8, which is preferably 
a monoclonal antibody. 

5 18. An antibody according to claim 17 for use as a 
medicament. 

19. Use of an antibody according to claim 17 in the 
preparation of a medicament for inhibiting angiogenic 

10 activity and formation and proliferation of new blood 
vessels, growth and development of tissues, tissue 
regeneration and organ and tissue repair or for 
treating cancer or rheumatoid arthritis or psoriasis 
or diabetic retinopathy. 

15 

20. A pharmaceutical composition comprising an 
antibody according to claim 17 together with a 
pharmaceutically acceptable carrier diluent or 

20 excipient therefor. 

21. A method of identifying VEGF-E in a sample which 
• method comprises contacting said sample with an 

antibody according to claim 17 and monitoring for 
25 binding of any protein to said antibody. 

22. A kit for identifying the presence of VEGF-E in a 
sample which comprises an antibody according to claim 
17 and means for contacting said antibody with said 

30 sample.^ 

23. A method of identifying compounds which inhibit 
angiogenesis which method comprises providing a host 

35 cell or organism according to claim 14 or a transgenic 



cell, tissue or organism according to claim 15, 
contacting a test compound with said cell, tissue or 
organism and monitoring for the presence or absence 
either of said reporter molecule or VEGF-E. 

24. A compound identifiable according to the method 
of claim 23. 

25. A compound according to claim 24 for use as a 
medicament. 

26. Use of a compound according to claim 24 in the 
preparation of a medicament for inhibiting angiogenic 
activity and formation and proliferation of new blood 
vessels, growth and development of tissues, tissue 
regeneration and organ and tissue repair or for 
treating cancer, rheumatoid arthritis, psoriasis or 
diabetic retinopathy. 

27.. A nucleic acid sequence comprising the nucleotide 
sequence of any of the sequences identified in Figure 
6 or 7 . 

28. An expression vector comprising a nucleic acid 
sequence according to claim 27. 

29. A host cell transformed or transfected with an 
expression vector according to claim 28. 

30. A method for producing a polypeptide, said method 
comprising the steps of: 

a) culturing the host cell of claim 29 under 
conditions suitable for expression of the 
peptide; and 

b) recovering the polypeptide from the host 
cell culture . 
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„3 M NIFLLNLLT22VRLY 

j - 

1 AGGAAATCAA ATTAGGATAA GATTTGTAXC TGATGAATAT TTTCCTTC TG AACCTTCTAA CAGAGGAGGT AAGATTATAC 
TCCTTTAGTT TAATCCTATT CTAAACATAG ACT ACT TATA AAAGGAAGAC TTGGAAGATT GTCTCCTCCA TTCTAATATG 



+3 


S C T I 


? R N ? 3 V S TREE LKR TDT IFWP GCL 


81 


AGCTGCACAC 
TCGACGTGTG 


CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC 
GAGCATTGAA GAGTCACAGG TATTCCCTTC T'TGATTTCTC TTGGCTATGG TAAAAGACCG 


CAGGTTCTCT 
GTCCAACAGA 


-2 








+3 


L V K 


RCGG NCA CCL HNCM ECQ CVP 


S K V 


161 


CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA ATGAATGTCA ATGTGTCCCA AGCAAAGTTA 
GGACCAATTT GCGACACCAC CCTTGACACG CACAACAGAG GTGTTAACGT TACTTACAGT TACACAGGGT TCGTTTCAAT 


-2 








+J 


T K K Y 


HEV LQLE ? K T G V R GLHK SLT 


D V A 






VSG DCT NHSP TWP 


241 


CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG GGATTGCACA AATCACTCAC 
GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GCCACAGTCC CCTAACGTGT TTAGTGAGTG 


CGACGTGGCC 
GCTGCACCGG 


-2 








+3 


LEHH EEC DCV C R G S TGG 




♦2 




V Q P. E H R R IAAS PPA 


ALA 


+1 


W S T 


M R S V TVC AEG AQED SRI T TS 


s s c 


321 


CTGGAGCACC 
GACCTCGTGG 


ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC 
TACTCCTCAC ACTGACACAC ACGTCTCCCT CGTGTCCTCC TATCGGCGTA GTGGTGGTCG 


AGCTCTTGCC 
TCGAGAACGG 


+2 


QSCA VQW L I L LETJV CV I SIL N'LSC L L Q 


+ 1 


P E L C 


SAV ADSI RER M R Y LHP 




401 


CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT CTCCATCCTT AATCTCAGTT 
GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA GAGGTAGGAA TTAGAGTCAA 


GTTTGCTTCA 
CAAACGAAGT 


♦2 


G P F 


I F R I YSA F 




481 


AGGACCTTTC 
TCCTGGAAAG 


ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC AAACAGAATT AGGAGTTGTG 
TAGAAGTCCT AAATGTCACG TAAGACTTTC TCCTCTGTAG TTTGTCTTAA TCCTCAACAC 


CAACAGCTCT 
GTTGTCGAGA 


561 


TTTCAGAGGA 
AAACTCTCCT 


GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG AAAATTAAAT GTTGTATTAA ATAGATCACC 
CCOGATTTCC TGTCCTCTTT TCCAGAAGTT AGCACCTTTC TTTTAATTTA CAACATAATT TATCTAGTGG 


641 


AGCTAGTTTC 
TCGATCAAAG 


AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT TTCAGTTCTT TCGATACGGC 
TCTCAATGGT ACATGCATAA GGTGATCGAC CCAAGACATA AAGTCAAGAA AGCTATGCCG 


TTAGGGTAAT 
AATCCCATTA 


721 


GTCAGTACAG 
CAGTCATGTC 


GAAAAAAACT GTGCAAGTGA GCACCTGATT CCGTTGCCTT GGCTTAACTC TAAAGCTCCA 
CTTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACGGAA CCGAATTGAG ATTTCGAGGT 


TGTCCTGGGC 
ACAGGACCCG 


8C1 


CTAAAATCGT 
GATTTTAGCA 


ATAAAATCTG GA 
TATTTTAGAC CT 





fa 1 
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1 


MNIFLLNIiLT 


EEVRLYSCTP SNFSVSIRSE LKRTJDTIFWF 


GCLLVKRCGG 


51 


NCACCLHNCN 


ECQCVKKVT KKYKEYLQLR PKJTGVRGLHK 


3LTDVALEHH 


101 


EECDCVCRGS 


TOO 
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+3 


MNI F L L KLLT E E V 


R L Y 


1 


AGGAAATCAA ATTAGGATAA GATTTGTATC TGATGAATAT T7TCCTTCTG AACCTTCTAA 
m/*v"TTTTkr , Tai mt&frrnTaTT PTM&ariT^ af TJVrTTATA AAAGGAAGAC TTCtGAAGAT'"! 1 


CAGAGGAGGT 
GTCTCCTCCA 


AAGATTATAC 
TTCTAATATG 


+3 


SCTP RNP SVS IR2E LKR TDT 


I F W P CCL 


81 


AGCTGCACAC CTCGTAACTT CTCAGTG7CC ATAAGGGAAG AACTAAAGAG AACCGATACC 
TCGACGTGTG GAGCATTGAA GAGTCACAGG TATTCCCTTC TTGATTTCTC TTGGCTATGG 


ATTTTCTGGC 
TAAAAGACCG 


CAGGTTGTCT 
GTCCAACAC-A 


-2 








+3 


LVK RCGG N C A CCL H N C N ECQ 


C V P 


$ K V 


161 


CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA ATGAATGTCA 
GGACCAATTT GCGACACCAC CCTTGACACG GACAACAC-AG GTGTTAACGT TACTTACAGT 


ATGTGTCCCA 
TACACAGGGT 


AGCAAAGTTA 
TCGTTTCAAT 


-2 










TKKY HEV LQLR PKT GVR GLHK SLT 


D V A 


+ 1 


V S G D C T 


N H S P T W P 


241 


CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG GGATTGCACA 
GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GCCACAGTCC CCTAACGTGT 


AATCACTCAC 
TTAGTGAGTG 


CGACGTGGCC 
GCTGCACCGG 


-2 








+3 


LEKH EEC DCV C R G S TGG 






+2 


V QRE MRR IAAS PPA 


ALA 


+1 


WST MRSV TVC AEG AQED SRI 


T T S 


s s c 


321 


CTGGAGCACC ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC 
GACCTCGTGG TACTCCTCAC ACTGACACAC ACGTCTCCCT CGTGTCCTCC TATCGGCGTA GTGGTGGTCG 


AGCTCTTGCC 
TCGAGAACOG 


t-2 


QSCA VQW LIL LENV CVI SIL 


N L S C L L Q 


+1 


PELC SAV ADSI RER MRY LHP 






401 


CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT CTCCATCCTT 
GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA GAGGTAGGAA 


AATCTCAGTT 
TTAGAGTCAA 


GTTTGCTTCA 
CAAACGAAGT 


+2 


GPF IFRI YSA F 






481 


ACGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC AAACAGAATT AGGAGTTGTG 
TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAASACTTTC TCCTCTGTAG TTTGTCTTAA TCCTCAACAC 


CAACAGCTCT 
GTTCTCGAGA 


561 


TTTGAGAGGA GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG AAAATTAAAT 
AAACTCTCCT CCGGATTTCC TGTCCTCTTT TCCAQAAGTT AGCACCTTTC TTTTAATTTA 


GTTGTATTAA 
CAACATAATT 


ATAGATCACC 
TATCTAGTGG 


641 


AGCTAGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT TTCAGTTCTT 
TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATCGAC CCAAGACATA AAGTCAAGAA 


TCGATACGGC 
AGCTATGCCG 


TTAGGGTAAT 
AATCCCATTA 


721 


GTCAGTACAG GAAAAAAACT GTGCAAGTGA GCACCTGATT CCGTTGCCTT GGCTTAACTC 
CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACGGAA CCGAATTGAG 


TAAAGCTCCA 
ATTTCGAGGT 


TGTCCTGGGC 
ACAGGACCCG 


eoi 


CTAAAATCGT ATAAAATCTC- GATTTTTTTK TTTTTTTTTG CGCATATTCA CATATGTAAA 
GATTTTAGCA TATTTTAGAC CTAAAAAAAN AAAAAAAAAC GCGTATAAGT GTATACATTT 


CCAGAACATT 
GGTCTTGTAA 


CTATGTACTA 
GATACATGAT 


881 


CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAACTTGTCT CGTGCTGATA GGACAGACTG GATTTTTCAT 
GTTTGGACCA AAAATTTTTC CTTGATACAA CGATACTTAA TTTGAACACA GCACGACTAT CCTGTCTGAC CTAAAAAGTA 


-3 
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9 61 ATTTCTTATT AAAATTCCTG CCATTTAGAA GAAGAGAACT ACATTCATGG TTTGGAAGAQ ATAAACCTGA AAAGAAGAQT 
TAAAGAATAA TTTTAAAGAC GGTAAATCTT CTTCTCTTGA TGTAAGTACC AAACCTTCTC TATTTGGACT TTTCTTCTCA 
-3 



1041 GGCCTTATCT TCACTTTATC GATAAGTCAG TTTATTTGTT TCATTCTGTA CATTTTTATA TTCTCCTTTT GACATTATAA 
CCGGAATACA AGTGAAATAG CTATTCAGTC AAATAAACAA AGTAACACA? GTAAAAATAT AAGAGGAAAA CTGTAATATT 
-3 1 



1121 CTGTTGGCTT TTCTAATCTT GTTAAATATA TCTATTTTTA CCAAAGGTAT TTAATATTCT TTTTTATGAC AACTTAGATC 
GACAACCGAA AAGATTAGAA CAATTTATAT AGATAAAAAT GGTTTCCATA AA7TATAAGA AAAAATACTG TTGAATCTAG 



1201 AACTATTTTT AGCTTGGTAA ATTTTTCTAA ACACAATTGT TATAGCCAGA GC AACAAAGA TGATATAAAA TATTGTTGCT 
7TGATAAAAA TCGAACCATT TAAAAAGATT TCTGTTAACA ATATCGGTCT CCTTGTTTCT ACTATATTTT ATAACAACGA 



1281 CTGACAAAAA TACATGTATT TCATTCTCGT ATGGTGCTAG AGTTAGATTA ATCTGCATTT TAAAAAACTG AATTGGAATA 
GACTGTTTTT ATGTACATAA AGTAAGAGCA TACCACGATC TCAATCTAAT TAGACGTAAA ATTTTTTGAC TTAACCTTAT 



1361 GAATTGGTAA GTTGCAAAGA CTTTTTGAAA ATAATTAAAT TATCATATCT TCCATTCCTG TTATTGGAGA TGAAAATAAA 
CTTAACCATT CAACGTTTCT GAAAAACTTT TATTAATTTA ATAGTATAGA AGGTAAGGAC AATAACCTCT ACTTTTATTT 



1441 AAGCAACTTA TGAAAGTAGA CATTCAGATC CAGCCATTAC TAACCTATTC CTTTTTTGGG GAAATCTGAG CCTAGCTCAG 
TTCGTTGAAT ACTTTCATCT GTAAGTCTAG GTCGGTAATG ATTGGATAAG GAAAAAACCC CTTTAGACTC GGATCGAGTC 



1521 AAAAACATAA AGCACCTTGA AAAAGACTTG GCAGCTTCCT GATAAAGCGT GCTGTGCTGT GCAGTAGGAA CACATCCTAT 
TTTTTGTATT TCGTGGAACT TTTTCTGAAC CGTCGAAGGA CTATTTCGCA CGACACGACA CGTCATCCTT GTGTAGGATA 



1601 TTATTGTGAT GTTGTGGTTT TATTATCTTA AACTCTGTTC CATACACTTG TATAAATACA TGGATATTTT TATGTACAGA 
AATAACACTA CAACACCAAA ATAATAGAAT TTGAGACAAG GTATGTGAAC ATATTTATGT ACCXATAAAA ATACATGTCT 



1581 AGTATGTCTC TTAACCAGTT CACTTATTGT ACCTGG 
TCATACAGAG AATTGGTCAA GTGAATAACA TGGACC 



17/12/98 19:00:10 Translation of putative human VEGF (long contig) i p age ± 



1 MNIFiLNLLT EEVRLY5CTP KMFSV5IREE LKRTDTTFWF 3CIjLVK?.CGG 



51 NCACCLHNCN ECQCVP5KV? KKYHEVLQLR FKTGVRGLKK 



101 EECI3CVCRGS TGG 



^ >3993I80H1 LUNGNON03 INCYTE 

CACAf-'^CACTCACCGACGTGGCCCl'GGAGCACCATGAGGNGTGTaACTGTGTGTGCAGAGGGAGCXCAGGAGGATAGCC 
j GCATC jCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 

4 CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAO 

5 AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCTAAAGGACAGGAGAANAGGTCTT 

6 >3 510192K1 CONCNOTQ1 INCYTE 

7 XGCAGTCCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTT 

8 TCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAG 

9 GAiSGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTT 
1 C TCAGAGTTACCATGTACGTATTCC ACTAGCTGGGTTCTGTATTT 

11 >2559870H1 ADRETUT01 INCYTE 

12 CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCA 

13 TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 

14 AGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTXATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCA 

15 TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 

16 >3979767Kl LUNGTUT08 INCYTE 

17 GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

18 GTTATCTCCATCCTTAATCTCAG'TTGTTTGCTTCAAGGACCTTTCATCTrCAGGATTTACAGTGCATTCTGAAAGAGGAG 

19 ACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTC7TCAATCGTG 

20 GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 

21 >3 980 011H1 LUNGTUT08 INCYTE 

22 GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

23 GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACATGCATTCTGAAAGAGGAGA 

24 CATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTC 

25 AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 

26 >4825396H1 BLADDXT01 INCYTE 

27 GAGAACCGATACCATTTTCTGGCCAGGTTG7CTCCTGGTTAAACGCTC 

2 8 GC AATGAATGTCAATGTGTCCC AAGC AAAGTTACTAAAAAATACCACGAGGTC CTTC AGTTGAGACCAAAGACCGGTGTC 

29 AGGGGATTGCACAAATCACTCACCGACGTGGCCCTC^AGCACCATGAGGAGTGTCACTGTGTGTGCAGAGGGAGCACAGG 

30 AGGATAGCCGCATCACCACCA 

31 >3 07370 3H1 BONEUNT01 INCYTE 

32 AGAAAATCC AGAGTGG TGG ATC TGAACC TTCTAAC AG AGGAGGT AAG AT TATAC AGCTGC AC ACC TCG TAAC TTC TC AGT 

33 GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACT 

34 GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAA 

35 TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

36 >1302516H1 PLACNOT02 INCYTE 

37 AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCT'GAACCTTCTAACAGAGGAGGTAAGATTATAC 

38 AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 

39 CCTGGTTAAACGCTCTGGTGGGAACTGTOCCTC^ 

40 ACTAAAAAATACCACGAGGTCC 

41 >3684109H1 HEAANOT01 INCYTE 

42 ATTTCATCTTCAGGATTTACAGTGCATTCTGAA^ 

4 3 GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAAT^ 

44 GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAG 
C 5 TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

46 >4713188H1 BRAIHCTOl INCYTE 

47 CAAAGTTACTAAAAJVATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

48 ACGTGGCCCTGGAGCACCATGAGGAGTG-TGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAG 

49 CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGT 

50 TTGCT 

51 >458823Hl KERANCT01 INCYTE 

52 ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCT^ 

53 GTTTGOTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTG 

54 CAACAGCTCTTTTGAGAGGAGGCCTAAATCNCAGGAGAAAAW^ 

55 ATAGATC 

56 >1303 909H1 PLACNOT02 INCYTE 

57 AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGAGGTAAGATTATAC 

58 AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAJ^GAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 

59 CCTGGTTAAACGCTGTGGT^GGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAG 

60 >2739211H1 OVAHNOT09 INCYTE 

61 GTGCATTCTGAAftGAGGAGACATCAAACAGAATTAGGAG TTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGA 

62 GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACG 

63 TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCT^ 

64 GTGAGCACCTGAT 

65 >3325591Hl FTHYNOT03 INCYTE 

66 tGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTKjGAAAGAAAATTAa^ 

67 AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACG 

68 GCTTAGGGTAATGTCAGTACAGGAAAAJ^CTGTGCAAGTGAGCArc 

69 ATGTCNNGGGCNAAAANCGAAAAAT 

70 >3733 565H1 SMCCNOS01 INCYTE 

71 CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGNAAGANGAGACATCAAACAG 
7 2 AATTAGGNGTTGTGCAAAAGCTCTTTTGAGATCAGGCCTAAAGGACAGG 

7 3 AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNCNGTATTCAGTCT 

74 TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG ^ 

75 >3554223H1 SYNONOT01 INCYTE £ /€ > £ 



ACGGC^AGGGTMTGTCAGTACAGG 

. CTCd '/TCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTO 

79 ACATTCTATGTACNACAAACCTGGTTTTTAAAAA6GA.\C 

80 >4507477B1 OVARTDT0 1 INCYTS 

81 GGCTAGTTTCAGAGTTACCATCTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTC'I'TTCGA'rACGGCTTAGGGTAAT 

82 GTCMTACAGGAAAAAAACTGTGCAAGTGAGCACCTC^ 

83 TAAAATCGTATAAAATCTGGA 

84 >4163 578H1 aRSTMOT32 INCYTE 

85 AATAGATCACCAGCTAGTTTCAGAGTXACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGATACG 
36 GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGA-GCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCC 
87 ATGTCCTGOGCCTAAAATCCTATA 



1 >2 0SC5?5H1 BfifiWUTui iwlmx 

JU^GG^CTATGTTGCTATGAATTAAACTTGTGTCGTGCTGA 

TCTGC TTTAGAAGAAGAGAACTACATTCATCKSTTTGGAAGAGATAAACCTCAAAAGA^ 

4 tATCGAT'AAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCT^ - 

5 TCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTA 

6 >3993180Hl LONGNOM03 INCYTE 

7 CACAAATCACTC/kCCGACGTGGCCCTGGAGCACCATGAGGNGTGTC-ACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCC 

8 GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTG^^ 

9 C cttaatctcagttgtttgcttcaaggacctttcatcttcaggatttacagtgcattctgaaagaggagacatcaaacag 

1 o aat?aggagttgtgcaacagctcttttgagaggaggctaaaggacaggagaanaggtctt 

11 >3510192H1 CONCNOTOI INCYTE 

12 tgcagtgcagtcgctgattctattagagaacgtatgcgttatctccatccttaatctcagttgtttgcttcaaggacctt 

13 tcatcttcaotatttacagtgcattctgaaagaggagacatcaaacagaattaggagttgtgcaacagctcttttgagag 

14 gaggcctaaaggacaggagaaaaggtcttcaatcgtggaaagaaaattaaatgttgtattaaatagatcaccagctagtt 

1 5 tcagagttaccatgtacgtattccactagctgggttctgtattt 

16 >4164633Hl BRSTNOT32 INCYTE 

17 CT TGTTAAATATATC?ATTTTTACCAAAGGT^^ 

18 GTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATG 

19 x^TTTCATTCTCGTATGGTGCTACAOTTAGATTAATCTGCATTTTAAAAAACTGAATOGGAATA 

2 0 AAGACTTTTTGANAATAATTAAATTATCATA'PCTTCC ATTCCTGTTATTGGGGGAGAAAAT 

21 >2559670H1 ADRETUT 0 1 INCYTE 

22 CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCA 

23 TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCA'PCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 

24 AGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCA 
2 5 TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 

26 >3817470H1 BONSTUT01 INCYTE 

27 TTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAA 

28 AATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTC 

2 9 ACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTT'TGACATTATAACTGTTGGC TTTC 

3 0 TAATCTGTTAAATATATCTATTTTTACC AAAGGTATTTAATATTCTTT 

31 >3979767H1 LUNGTUT08 INCYT2 

32 GGAGG AT AGCCGCATCACC AC CAGCAGCTCTTGCCC AG AGCTGTGCAGTGCAGTGGCTGATTCTATT AG AG AACGTATGC 
3 3 GTTATCTCCATCCTTAATCTCAGTTCTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAG 

34 ACATO^CAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCMTCGTG 

35 GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 
35 >3980011H1 LUNGTUT08 INCYTE 

37 GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAC-AGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

38 GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACATGCATTCTGAAAGAGGAGA 

39 CATCAAJVCAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGG 

40 AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 

41 >482 5396HI BLADDIT01 INCYTE 

42 GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCvACAATT 

43 GCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC 

44 AGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGG 

45 AGGATAGCCGCATCACCACCA 

45 >3073703H1 BONEUNT01 INCYTE 

47 AGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGT 

48 GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACT 

49 GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACC 

50 TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

51 >362169H1 BRAITUT03 INCYTE 

52 AGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCA 

5 3 TTTTAAAAAACTGAATTGG AATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATC ATATCTTCC ATTC 

54 CrcTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTT 

55 GGGGAAATCTGAGCCTAGC 

56 >4201385H1 BRAITUT29 INCYTE mf ^™* m 

57 TTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTAT 

58 TAAAATTTCTCCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAA 

59 TCACTrTATCGATAAGTC^TTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTGACATATAACTGTTGGCTTTT 
6 0 CTAATCTGTTAAATATATCTATTTTTACCAAAGGTATTTAATAT 

€1 >1302S16H1 PLACNOT02 INCYTE 

6 2 AGGAAATC AAATTAGGATAAGATT^GTATCTGATGAATATTTTCCTTCTG AACC TTCTAAC AG AGGAGGTAAGATTAT AC 

63 AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 

64 CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTT 

65 ACTAAAAAATAC CACGAGGTCC 

66 >3684109H1 HEAANOT01 INCYTE ,^ m „^* 

67 ATTTCATCTTCAGGATTTACAGTGCATTCTGAAANAGGAGaAATCAAACANAATTAGGAGTTGTGCAACAGCTCTTTTGA 

68 GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTA^ 

69 GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAG 

7 0 TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

71 >2549720H1 LUNGTUT06 INCYTE 

72 TTAGCTTGGNAAATTTTTCTAAACACAATTGTTATAGCCA^ 

73 AATACATGTATTTCATTCTCGTATGGTGCTAGAGTO 

7 4 AAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAA 
7 5 TATGANAGTAG f^i <f 4> 



76 >877279H1 LUNGAST01 INCYTE 

■ CTTT?*TATGACAACTTAGATCAACTATTTTTAG 
GATG ^TAAAATATTGTTGCTCTGA0AAAAArACAT3TATTTCAT / TCTCGTATGG7GCTAGAGTTAGATTAATCTGCAT 
,79 TTTAAAAAACTGAATTV3GAATAGAATTGGTAAGTTCCAAAG0CTTTTTG 
80 TGTTATTGGNGG 

61 >4 713163H1 BRAIKCT01 1 INCYTE 

82 CAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCG 

83 AC G TG GCC CTGGAGC AC C ATG AGGAGTGTG AC TGTGTGTGC AG AGGGAGC AC AG GAGGATAGCCGCATCACC AC CAGC AG 

84 CTCTTGCCCAGAGCTGTGCAOTGCAGTGGCTGATTCTATTAGAGAACGTArGCGTTATCTCCATCCTTAATCTCAGTTGT 

85 TTGCT 

86 >2171082Kl ENDCNOTC3 INCYTE 

87 AGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTOTGTACATTTTTA 

88 TATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATC^ 

89 CTTTTTTATGACAACTTAGATCAACTATTTT 

90 GATGA 

91 >875860H1 LUTJGA3T0L INCYTE 

92 CTGGATTTTTCATATTTCTTATTAAAATTTCTWCATC^ 

93 TGAAAAGAAGAGTCGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCA7TGTGTACAT 

94 TTTGACATTATAACTGTTGGCTTTTC TAATCTTGTTAAATATArCTATTTTTACCAAAGGTATTTAATATTCTTTTTTAT 

95 GAC 

96 >7C616QH1 SYNOKAT04 INCYTE 

97 GCTCATAI^CACATATGTAAACCAGAACATTCTATC 

98 TAAACTTGTGTCGTGCTGATAGGACAGACTCGATTTTTCATA^ 

99 TACATrcATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCAKTTTATCGATAAGTCAGTTTATTTGT 

100 TTCA 

101 >458S23H1 KERANOT01 INCYTE 

102 ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTT 

103 GTTTGNTTC AAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATC AAACAGAATTAGGAGTTGTG 

104 CAACAGCTCTTTTGAGAGGAGGCCTAAAGGNCAGGAGAAAAGGTCTTCAATCGTGGA^^ 

105 ATAGATC 

106 >538436H1 LNCDNOTC2 INCYTE 

107 AAAGATGATATAAAATATTCTTGCTCTGACAAAAAra 

108 C ATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTC 

109 TCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTAT 

110 >13O3909Kl PLACNOT02 INCYTE 

111 AGGAAATC AAATT AGS AT AAGATTTGTATCTGA TGAATATTTTCCTTCTGAACCTTCTAAC AGAGGAGGTAAGATTATAC 

112 AGCTGCACACCTCGTAAC TTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 

113 CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCMTGAATGTCAATGTGTCCCAAG 

114 >2739211H1 OVARNOT09 INCYTE 

115 G TGC AT TC TGAAAGAG GAG AC ATC AAAC AG AATT AGG AG TTG T G C AAC AGC TCTTT TG AG AGG AGGC C T AAAGG AC AGG A 
115 GAAJUVGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACG 

117 TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGAT.^CGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAA 

118 GTGAGCACCTGAT 

119 >255C343Hl LUNGTUT06 INCYTE 

120 TGTACATTTTTATATTCTCCTTTTGACATTATAAC^ 

121 GTATTTAATATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGC 

122 CAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATrCTCGTATCGTGCTA 

123 >5321149H1 FIBFFEN26 INCYTE 

124 CAC7\ATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTCCTCTGNCAAAAATACATC 

12 5 TGGTGCTAGAGTTAGATTAATCTGCA-TTTTAAAAAACTGAATT^ 

126 TAATTAAATTATC ATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAAATTCAGATCCAC 

127 CATTACTAAC 

128 >37949 5H1 THYRNCTG2 INCYTE 

125 ATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATC 

130 AGACTTTTTGAAAATAATTAAATTATC ATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGT 

131 AGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCT 

13 2 TGAAAAA 

13 3 >332S5 91H1 PTHYNOT03 INCYTE 

134 TGCAAC AGCTCTTTTGAG^GAGGCCTAA^GGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATT 
13 5 AAATAGATC ACC AGCTAGTTTCAGAGTTACCATGTACGTATTCC ACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACG 

136 GCTTACGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGA.TTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

137 ATGTCNWGGGCWAAAANCGAAAAAT 

13 8 >543 890H1 OVAKNOT02 INCYTE 

139 TTTCT AAACAC AATTGTT ATAGCCAGAGGAAC AAAGATGAT ATAAAATATT3TTGCTCTGACAAAAATACATGTATTTC A 

140 TTCTCGTATGGTGCTA3AGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGNATAGAATTGGTAAGTTGCAAAGNCTT 

141 TTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAG 

142 ACATTCAGATC 

143 >3733565H1 SMCCNOS01 INCYTE 

144 CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTC^ 

14 5 AATTAGGNGTTGTGCAAAAGCTCTTTTGAGAGGAGGCC7AAAGGACAGGAGAA 

14 6 AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTAO^CCACTAGCTGGGNCNGTATTCAGTCT 
14 7 TTCGG AACGGC TTAGGG TAATG TCAG TACAGGANAAAAAC TG TG CAG TGAG 
143 >4641939H1 PRCSTMTC3 INCYTE 

14 9 GTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGC TATGAATTAAACTTGTGTCCATGCTGATAGGACAGACTGGAT 

15 0 TTTNCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAG 



151 GAAGAGTGGCCTTATC'i ,, ii:AC'l"rrATC'tiATAaLiTUM^l iTA'i "iT'o'i "rru/i lij'avi^n^ A.iAArtAx>A*w*s.^.-.* v ~w~ * 
: ATAAC^TGGCTTT 

1_ >2 0G' '0H1 TESTNOT03 INCYTE 

154 TTATATTCTCCTTTTGACATTATAACTGTTGGCTT^ 

155 ATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTT^ 
155 AAAGATGATATAAAATATTGTTGCTCTGANAAAAATACATGTAT 

157 >3085331H1 HEAONOT03 INCYTE 

158 GCTCATATTCACATATCTAAACCAGAACATTCT 

155 AAAC'TTGTGTCGTGCTGATAGCACAGACTGGNTTTTTCATATTTCTTATrTANAATTTCTGCCATTAGAAGAAGAGAACT^ 
L 6 0 CATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTATTTC ACTTTATCGATAAGTC AGT 

161 >3414043H1 PTHYNOT04 INCYTE 

162 GCTC ATATTCACATATGTAAACCAGAACATTCTATGTACTAC AAACCTOOTTTTTAAAAAGGAACrATGTTGCTATG AAT 

163 TAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTC 

164 TACATTCATGGTTTGGAAGAGATAAACCTGAAA 

165 >370 5963H1 PENCNOT07 INCYTE 

166 ANACTCTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAMTCGTATAAAA 

167 TCTGGArmrmnnnririnnnimiinnnGCTCATATTCACATATGTAAACCA 

1 6 S AAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTC 

169 TTCTCCCATTAGAAGAAGAGAACTACNTTCANGGTTTGGAAGAGATAACCCTGAAAAGANGGG 

170 >5137051K1 OVARDIT04 INCYTE 

171 AAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTNTTTC 

172 TATTGGAGATGAANATAAAAAGCAACTTATGAAAGTAGACATTCAGA^ 

173 AAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTNTGTCA 

174 GTAGGAACACATCCTATTTATTGTGATGNTGTGGTTTATTAT 

175 >3554223H1 SYNONCT01 INCYTE 

176 ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCC ACTAGCTGGGTTCTGTATTTC AGTTCTTTCGAT 

177 ACGGCOTAGGGTAATOTCAGra 

178 CTCCATGTC CTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTNTTTTTTTTTGCGC ATATTC AC ATATGT AAACCAGN 

179 AC ATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 

190 >4507477H1 OVARTDT01 INCYTE 

1 B 1 GGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCrGTATTTC AGTTCTTTCGATACGGCTTAGGGTAAT 

182 GTCAGTACAGGAAAAAAACTOTGCAAGTGAGCACCTGATTCCGTTGCCTTGC 

183 TAAAATCGTATAAAATCTGGA 

1B4 >1955646H1 CONNNOT01 INCYTE 

185 TGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGC 

186 AACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAA^ 

187 ACATAAAGC ACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGGAAC ACATCCTATTTA 

188 TTGTG ATGTTGTGGTTTATATCCTAAACC 

189 >4163378H1 BRSTNOT3 2 INCYTE 
1 SC AATAGATCACCAGCTAGTTTCAGAGTTACCATC^^ 

191 GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTCCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAAC 

192 ATGTCCTGGGCCTAAAATCGTATA 

193 >5095141H1 EPIMNON05 INCYTE 

194 AGATAAACCTGAAAAGAAGAGTGGCCTrATNTTCACTTTATCGATAAGTCAGNTTATTTGTTTC 

195 TATTCTCCTTTTGACATTATAACTGNTGGCTTTTCT 

196 CTTT 

197 >943 82 6H1 ADRENOT03 INCYTE 

198 TATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAA 

199 AATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATG 

2 00 >34512 73H1 UTRSNON03 INCYTE 

2 01 TTTTTTNTTTTGCTCATATTCAC ATATGTAAACCNGAACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAACTATG 
2 02 TTGCTATGAATTAAACTTGTGTCGTGCTGATAGGAC AGACT'OGATTTTTCANATTTCTTANTAANNTTTCTGCC ATTTAG 
2 03 AAGA 

2 04 >1402278H1 LATRTUT02 INCYTE 

205 GTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAA 
2 06 ATCGTATAAAATCTGGAnnnnnnnnnnnnn^^ 

2 07 CCTGGTTTTT AAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTC ATATTT 

208 CTTA 

209 >4361191H1 SK£KNOT01 INCYTE 

210 GCAAAGACTTTTTGANAATNATTAANTTATCATATCTTCCATTCCTGTTATNGGAGATGANAATAAAAAGCAACTTATGA 

211 AAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCNCAGAAAAACATAAAGC 

212 ACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGC-AACACATCCNATTTATTGTGNTGTN 

213 GNGGTTTTATGATC 

214 >1307017H1 PLACNOT02 INCYTE 

215 TGTCAGTACAGGAAAAAAACTGTGCAAGTCAGCACCTGATTCCGTTGCCTTOCTTAACTCTAAAGCTCCATGTCC7<3GGC 

2 1 6 CTAAAATCGTATAAAATCTGGArmnnnnrmnnniinnn 

217 AC AAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGAC AGACTGGATTTTTCA 
213 TAT 

219 >5032225H1 HEARFET03 INCYTE 

2 2 0 AATTATCATATCTTCCATTCCTGTTATTGGAGATGNAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCAT 

221 TACTAACCTATrcCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTGTCAGCTTC 

222 CTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGT'rGTGGTTTTATTATCTTAAACTCGTT 

22 3 CCAT 

224 >3732621H1 SMCCNOS01 INCYTE . 

22 5 ANAGATGATATAAAANATTGTTGCTCTG AC AANNATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTG 



2'"' >3530274H1 BLADNOT09 INCYTE 

TTCC-'TCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATT 
72 * CCTT 'TGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCG 

230 TGCTGTGCTGTC-CAGTAGGAACAC ATCCTATTT ATTGTGATGTTGTGGTTTTATTATCTAAACTCTGTTCC ATAC ACTTG 

231 TATAAATACATGGATATTTTTATC-TACAGAAGTATGTCTCTTAACCAGTTCA 
">\7 >3530249Hl BLADNOT09 INCYTE 

7^1 C^TCCATTCCTGTTATTGGAGAI^GAAAATAAAAAGCAACTTATGANAGTAGACATTCAGATCCAGCCATTACTAACCTAT 
7 3 4 tCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGG-CAGCTTCCTGATAAAGC 

235 gt^ctgtgctgtgcagtaggaacacatcgtattta.ttgtgatgttgtggttttattatcttaaactctgttcca'tacact 
23 6 tgtataaatac atggatatttttatgtacagaagtatgtctcttaaccagtt-cacttattgtacctcg 

237 



VEGFE1 


AA AATGT ATGG AT AC A ACTT A C 


22 


VEGFE2 


GTTTGATGAAAG A I'll GGGCTTG 


23 


VEGFE3 


TTTCTAAAGGAAATCAAATTAG 


22 


VEGFE4 


GATAAGATTTGTATCTGATG 


20 


VEGFE5 


GATGTCTCCTCTTTCAG 


17 


VEGFE6 


GCACAACTCCTAATTCTG 


18 


VEGFE7 


AGCACCTGATTCCGTTGC 


19 


VEGFE8 


TAGTACATAGAATGTTCTGG 


20 


VEGFE9 


AAGAGACATACTTCTGTAC 


19 


VEGFE10 


CCAGGTACAATAAGTGAACTG 


21 



Fig. 7 
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Budapest Treaty on the International Recognition of the Dep sit of Microorganisms for 

the Purp s s of Pat nt Procedur 

Rec ipt in the case of an original deposit issued pursuant to Rul 7.1 by the 
International Depositary Authority BCCM™/LMBP identified at the bottom of next page 



International Form BCCM™/LMBP/BP/4/99-23 



To : Name of the depositor : Janssen Pharmaceutica N.V. 



Address 



: Turnhoutseweg 30 
B-2340 Beerse 
Belgium 



Identification of the microorganism: 

1.1 Identification reference given by the depositor: 



VEGF-X CUB PET22b 



1.2 Accession number given by the International Depositary Authority: 



LMBP 3991 
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II. 



Scientific description and/ r proposed tax n mic designation 



The microorganism identified under I above was accompanied by: 



(mark with a cross the applicable box(es)) 



- a scientific description 



yes [3 



no □ 



- a proposed taxonomic designation 



yes Q 



no [X] 



III. Receipt and acceptance 

This International Depositary Authority accepts the microorganism identified under I 
above, which was received by it on (date of original deposit) : December 20, 1 999 



IV. International Depositary Authority 

Belgian Coordinated Collections of Microorganisms (BCCM™) 

Laboratorium voor Moleculaire Biologie - Plasmidencollectie (LMBP) 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 

Signature(s) of person (s) having the power to represent the International Depositary 
Authority or of authorized official(s): 




Date : January 12, 2000 



Martine Vanhoucke 
BCCM/LMBP curator 



